智能论文笔记

AutoSNN: Towards Energy-Efficient Spiking Neural Networks

Byunggook Na , Jisoo Mok , Seongsik Park , Dongjin Lee , Hyeokjun Choe , Sungroh Yoon

分类：神经与进化计算 | 人工智能 | 机器学习

2022-01-30

尖峰神经网络（SNNS）模仿大脑中信息传播可以通过离散和稀疏的尖峰来能够能够通过离散和稀疏的尖峰来处理时空信息，从而受到相当大的关注。为了提高SNN的准确性和能源效率，大多数以前的研究仅集中在训练方法上，并且很少研究建筑的效果。我们研究了先前研究中使用的设计选择，从尖峰的准确性和数量来看，发现它们不是最适合SNN的。为了进一步提高准确性并减少SNN产生的尖峰，我们提出了一个称为Autosnn的尖峰感知神经体系结构搜索框架。我们定义一个搜索空间，该搜索空间由架构组成，而没有不良的设计选择。为了启用Spike-Aware Architecture搜索，我们引入了一种健身，该健身既考虑尖峰的准确性和数量。 Autosnn成功地搜索了SNN体系结构，这些体系结构在准确性和能源效率方面都超过了手工制作的SNN。我们彻底证明了AutoSNN在包括神经形态数据集在内的各种数据集上的有效性。

translated by 谷歌翻译

Attention-Aware Anime Line Drawing Colorization

Yu Cao , Hao Tian , P. Y. Mok

分类：计算机视觉 | 人工智能

2022-12-21

Automatic colorization of anime line drawing has attracted much attention in recent years since it can substantially benefit the animation industry. User-hint based methods are the mainstream approach for line drawing colorization, while reference-based methods offer a more intuitive approach. Nevertheless, although reference-based methods can improve feature aggregation of the reference image and the line drawing, the colorization results are not compelling in terms of color consistency or semantic correspondence. In this paper, we introduce an attention-based model for anime line drawing colorization, in which a channel-wise and spatial-wise Convolutional Attention module is used to improve the ability of the encoder for feature extraction and key area perception, and a Stop-Gradient Attention module with cross-attention and self-attention is used to tackle the cross-domain long-range dependency problem. Extensive experiments show that our method outperforms other SOTA methods, with more accurate line structure and semantic color information.

translated by 谷歌翻译

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann , Annika Reinke , Vivienn Weru , Minu Dietlinde Tizabi , Fabian Isensee , Tim J. Adler , Patrick Godau , Veronika Cheplygina , Michal Kozubek , Sharib Ali

分类：计算机视觉 | 机器学习

2022-12-16

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.

translated by 谷歌翻译

Comparative Validation of AI and non-AI Methods in MRI Volumetry to Diagnose Parkinsonian Syndromes

Joomee Song , Juyoung Hahm , Jisoo Lee , Chae Yeon Lim , Myung Jin Chung , Jinyoung Youn , Jin Whan Cho , Jong Hyeon Ahn , Kyung-Su Kim

分类：人工智能

2022-07-23

大脑磁共振成像（MRI）扫描的自动分割和体积对于诊断帕金森氏病（PD）和帕金森氏症综合症（P-Plus）至关重要。为了提高诊断性能，我们在大脑分割中采用了深度学习（DL）模型，并将其性能与金标准的非DL方法进行了比较。我们收集了健康对照组（n = 105）和PD患者（n = 105），多个全身性萎缩（n = 132）和渐进性超核麻痹（n = 69）的大脑MRI扫描。 2020.使用金标准的非DL模型FreeSurfer（FS），我们对六个脑结构进行了分割：中脑，PON，CAUDATE，CAUDATE，PUTATATE，pALLIDUM和THIRD CNTRICLE，并将其视为DL模型的注释数据，代表性V -net和unet。计算了分化正常，PD和P-Plus病例的曲线下的骰子分数和面积。每位患者六个大脑结构的V-NET和UNETR的分割时间分别为3.48 +-0.17和48.14 +-0.97 s，比FS（15,735 +-1.07 s）快至少300倍。两种DL模型的骰子得分都足够高（> 0.85），它们的疾病分类AUC优于FS。为了分类正常与P-Plus和PD与多个全身性萎缩（小脑型）的分类，DL模型和FS显示出高于0.8的AUC。 DL显着减少了分析时间，而不会损害大脑分割和差异诊断的性能。我们的发现可能有助于在临床环境中采用DL脑MRI分割并提高大脑研究。

translated by 谷歌翻译

Unsupervised Deformable Image Registration with Absent Correspondences in Pre-operative and Post-Recurrence Brain Tumor MRI Scans

Tony C. W. Mok , Albert C. S. Chung

分类：计算机视觉

2022-06-08

通常需要对术前和术后大脑图像进行注册，以评估脑神经胶质瘤治疗的有效性。尽管最近基于深度学习的可变形注册方法在健康的大脑图像方面取得了显着的成功，但由于参考图像中缺乏对应关系，它们中的大多数人将无法与病理相处。在本文中，我们提出了一种基于深度学习的可变形登记方法，该方法共同估计缺乏对应关系和双向变形场的区域。前向后的一致性约束用于帮助从两个图像中缺乏对应关系的体素的切除和复发区域的定位。来自Brats-Reg挑战的3D临床数据的结果表明，与传统和深度学习的注册方法相比，我们的方法可以改善图像对齐方式，无论是否具有成本函数掩盖策略。源代码可在https://github.com/cwmok/dirac上获得。

translated by 谷歌翻译

Learn2Reg: comprehensive multi-task medical image registration challenge, dataset and evaluation in the era of deep learning

Alessa Hering , Lasse Hansen , Tony C. W. Mok , Albert C. S. Chung , Hanna Siebert , Stephanie Häger , Annkristin Lange , Sven Kuckertz , Stefan Heldmann , Wei Shao

分类：计算机视觉

2021-12-08

迄今为止，迄今为止，众所周知，对广泛的互补临床相关任务进行了全面比较了医学图像登记方法。这限制了采用研究进展，以防止竞争方法的公平基准。在过去五年内已经探讨了许多新的学习方法，但优化，建筑或度量战略的问题非常适合仍然是开放的。 Learn2reg涵盖了广泛的解剖学：脑，腹部和胸部，方式：超声波，CT，MRI，群体：患者内部和患者内部和监督水平。我们为3D注册的培训和验证建立了较低的入境障碍，这帮助我们从20多个独特的团队中汇编了65多个单独的方法提交的结果。我们的互补度量集，包括稳健性，准确性，合理性和速度，使得能够独特地位了解当前的医学图像登记现状。进一步分析监督问题的转移性，偏见和重要性，主要是基于深度学习的方法的优越性，并将新的研究方向开放到利用GPU加速的常规优化的混合方法。

translated by 谷歌翻译

MUM : Mix Image Tiles and UnMix Feature Tiles for Semi-Supervised Object Detection

JongMok Kim , Jooyoung Jang , Seunghyeon Seo , Jisoo Jeong , Jongkeun Na , Nojun Kwak

分类：计算机视觉

2021-11-22

最近最近的半监督学习（SSL）研究建立了教师学生的建筑，并通过教师产生的监督信号训练学生网络。数据增强策略在SSL框架中发挥着重要作用，因为很难在不丢失标签信息的情况下创建弱强度增强的输入对。特别是当将SSL扩展到半监督对象检测（SSOD）时，许多与图像几何和插值正则化相关的强大增强方法很难利用，因为它们可能损坏了对象检测任务中的边界框的位置信息。为解决此问题，我们介绍了一个简单但有效的数据增强方法，MIX / unmix（MUM），其中解密为SSOD框架的混合图像块的瓷砖。我们所提出的方法使混合输入图像块进行混合输入图像块，并在特征空间中重建它们。因此，妈妈可以从未插入的伪标签享受插值正则化效果，并成功地生成有意义的弱强对。此外，妈妈可以容易地配备各种SSOD方法。在MS-Coco和Pascal VOC数据集上的广泛实验通过在所有测试的SSOD基准协议中始终如一地提高基线的地图性能，证明了妈妈的优越性。

translated by 谷歌翻译